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Abstract. It is well known that S2 cannot prove the injective weak pigeonhole principle for 
polynomial time functions unless RSA is insecure. In this note we investigate the provability of the 
surjective (dual) weak pigeonhole principle in S2 for provably weaker function classes. 



1. Introduction 

The weak pigeonhole principle for a relation R{x, y) says that R does not represent an injective 
map from pigeons to n holes. Variants of the weak pigeonhole principle have been shown to 
be connected with cryptography and circuit lower bounds in several different ways. Krajfcekand 
Pudlak [2j have shown that if the theory 5*2 can prove the principle for graphs of p-time functions 
then the cryptographic scheme RSA is insecure. Here 5*2 is roughly a theory which has axioms for 
the symbols of arithmetic and length induction axioms for NP-predicates. The surjective (dual) 
variant of the weak pigeonhole principle states there is no surjective map from n pigeons onto 
holes. ^ Jefabek §3] has shown that the surjective weak pigeonhole principle for p-time functions 
is equivalent over ^2 to (essentially) the schema that asserts that for each fixed A: > that there is 
a string of length 2n^ that cannot be bit-recognized by any circuit of size n^. More recently, Pollett 
and Banner ^31 have shown that the multifunction weak pigeonhole principle for iterated p-time 
relations is equivalent over 52 to the existence of strings that are hard for an iterated circuit block 
recognition principle. This implies that if RSA is secure then ^2 cannot prove superpolynomial 
circuit lower bounds for multifunctions computed by iterated p-time relations. In an attempt to 
make progress towards making these contingent results non-contingent, the present note investigates 
whether there are any interesting classes of functions for which ^2 can prove the weak pigeonhole 
principle. 

Proofs of the pigeonhole principle usually start by assuming one has a map that violates the 
pigeonhole principle, then constructing a submap that also violates the pigeonhole principle and 
applying induction to get an obvious contradiction, such as an injective map of two objects into 
one. The weakest theory known to prove the weak pigeonhole principle for graphs of p-time mul- 
tifunctions is T2 , which is defined like ^2 but with usual induction for NP'^ "^-predicates. This was 
shown by Maciel et al. |H1 §6] following essentially this paradigm. The authors assume that they 
have a multifunction mapping n? pigeons to n holes. The pigeons are split into groups of size n and 
the holes into two groups of size n/2. They then argue that either (1) all of one group of pigeons 
must be mapped into the first group of holes, or (2) one can pick one pigeon from each group so 
that pigeons from different groups are mapped to different holes (all in the second group). In either 
case one gets a map from n pigeons to n/2 holes which is amplified to a map from pigeons to 
n/2 holes using the original map. This process is then iterated. The entire argument is carried 
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Some authors refer to the principle that asserts that there is no bijective map from n pigeons onto n holes as 
the onto principle; we shall not refer to this principle or use this terminology in this paper. 
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out in 5*2 , which is conservative over T| for S3 formulas (an in particular, for the weak pigeonhole 
principle) . 

In contrast to the above technique for proving the weak pigeonhole principle, in the current 
paper we use a technique that clearly illustrates the cryptographic nature of these principles. We 
consider a function algebra which is the closure of the terms of the language of 52 under 3- 
lengths bounded primitive recursion (see Definition^. Working in S2 we show that any function 
in omits values of the form [{n^n — l)/3j from its range, where x#y = 21^11^1. Pollett HHIE] 
has connected the algebras A"^ to weak theories of arithmetic, and the techniques of those papers 
can be used to show that if f{x) E A"^, then /(x) 7^ L^/3j. In this paper we prove the much 
harder statement that for any n and any a < n, f{a) 7^ [{n^n — l)/3j; in particular, f{x) is 
not a surjection from {0, . . . , n — 1} onto {0, . . . , n^n — 1} (which we will refer to as a surjection 
from n onto n^n). Our technique uses a new complexity measure that we call the prefix series for 
f{x) (Definition 2)) which might be useful in future work. It should be noted that S2 can prove 
the surjective pigeonhole principle for n onto from the principle for n onto n^n. However, the 
amount of iteration takes one (just barely) out of the class A^. 

We now discuss the organization of the rest of the paper and give a high-level sketch of the proof. 
In the next section we introduce the necessary notations from bounded arithmetic and define our 
function algebras. In Section|31we define the notion of a "prefix series." Roughly speaking, a prefix 
series for f{x) is a representation of f{x) as a difference of sums of prefixes of the values x. In 
Theorem ini we establish a bound on the length of such prefix series. In Section we convert the 
prefix series representation to one in which the prefixes are replaced by bits. We compute a bound 
on the length of such a representation and combine it with Theorem ^ to compute a bound on 
the number of times the binary representation of f{x) can alternate between and 1 fLemmas 1101 
andlllj). For f G A^ this bound is provably lower than the number of alternations in \_{n^n — 1) /3j , 
allowing us to conclude that / is not a surjection from n onto n^n (Theorem^)). We conclude 
with some remarks on generalizations and extensions. 



2. Preliminaries 

This paper assumes familiarity with the texts of either Buss Krajicek or Hajek and 
Pudlak For completeness, we review the basic notations of bounded arithmetic. The specific 
bootstrapping we are following is that of Pollett 10;; , but yields equivalent theories to the ones in 
the books just mentioned. The language L2 contains the non-logical symbols 0, S, +, •, =, <, — , 
[^x\, \x\, MSP{x,i) and #. The symbols 0, S{x) = x -|- 1, -|-, •, and < have the usual meaning. 
The intended meaning of x — y is x minus y if this is greater than zero and zero otherwise, [^xj is 
X divided by 2 rounded down, and |x[ is [log2(x -|- 1)], that is, the length of x in binary notation. 
MSP{x,i) stands for 'most significant part' and is intended to mean [x/2*J. Finally, x#?/ reads 'x 
smash y' and is intended to mean 2l^'II^L The original formulations of bounded arithmetic do not 
usually include MSP{x,i) and — , but instead define them with formulas. One advantage to our 
approach is that one can define terms in the language to do a limited amount of sequence coding, 
which allows us to more directly formulate our principles in the language L2. 

The bounded formulas of L2 are classified into hierarchies and 11^ by counting alternations 
of quantifiers, ignoring sharply-bounded quantifiers, analogous to the hierarchies and 11^ of the 
arithmetic hierarchy. Here sharply bounded means bounded by a term of the form \t\. Formally, 
a Sg (Ilg) formula is one in which all quantifiers are sharply-bounded. The (n^_^^) formulas 
contain the U H^ formulas and are closed under -lA, A^B,BAC,B\/C, sharply-bounded 
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quantification, and bounded existential (universal) quantification, where A is n^_|_x (^i+i) ^^'^ B 
and Care 

The theory BASIC is axiomatized by a finite set of quantifier- free axioms for the non-logical 
symbols of L2. IND^ consists of formulas of the form 

A{0) A {yx){A{x) A{Sx)) {yx)A{£{x)). 

for i £ T where r is collection of unary functions. Let id denote the identity function. C-IND and 
-LIND (length induction) are obtained by taking A € C and r to be {id} and {|id|}, respectively 
(we will write |id| for x |id(x)|, etc.). The theories T2 and S2 are axiomatized as BASIC together 
with respectively T.^-IND and T^^-LIND. 

We next briefly consider sequence coding and bit manipulation in our systems of arithmetic. The 
term BIT(i,w) := MSP{w,i) — 2 • lMSP{w,i)/2\ is the i-th bit of w. The ordered pair {x,y) can 
be defined as the binary string l{x)l{y) where (x) is the binary representation of x padded with O's 
on the left to have length |rE| -|- \y\ and similarly for (y). Sequences can be defined as ordered pairs 
in which the first component specifies a block size and the second a concatenation of blocks. The 
predicate Seq{s) that is true when s is the code of a sequence can be given a SQ-definition. The 
function SqBd{a, b) := 64(2#a#(2(26+ 1))) is a bound on the value of any sequence of length < 
each of whose components is < a, and (3{b, w) is defined to be the b-th. element of the sequence w. 
P{b, w) can be defined as a term in our language, and the basic properties of SqBd and P{b, w) can 
be proved using open length induction. We will use sequences of pairs extensively in this paper, so 
define the term PSqBd{a,b) = SqBd{SqBd{a,2'^),b) that is a bound on the value of any sequence 
of pairs of length < |6| for which each component of each pair is < a. 

The theory S2 can prove the existence of sequences and properties of sequences using length 
induction if particular elements in the sequence have S^-definitions. Sometimes it will be convenient 
to use other principles more directly connected to sequences. It is known that S2 can prove the 
following H I- RE PL principle (see [T] or PI): 

Vx < |6| 3y < aA{x, y) ^ 3w < SqBd{a, 2b + l)Vi < |6| {f3{i, w) < a A A{x, (3{i, w))) . 

where ^ is a S^-formula. Using this principle, we can S^-define the sequence (/(O, x), /(I, x), . . . , /(p(|x|), x)) 
where p is a polynomial provided we know /(i,x) is S^-definable (see below). Further it can be 
shown that 5*2 can prove basic properties of this sequence. The Tll-REPL scheme can be used to 
prove another useful scheme in S2, that of Ti^-COMP 

{3w < 2l"l)(Vi < \a\){A{i,a) ^ BIT{i,w) = 1). 

which allows one to get a bit-string of values for a S^-formula A{i,a). 

The IND'^ scheme is closely connected with the following type of bounded primitive recursion: 

Definition 1. {BPK^) Let r be a set of unary functions. / is defined from functions h, t and r 
by T -length bounded primitive recursion if: 

F(0,x) = <7(x) 
F{n + 1, x) = min(/i(n, x, -F(n, x)), r(n, x)) 
/(n,x) = F(^(t(n,x)),x) 

for some r,t £ L2 and (. £ t. 

Let L2 be the language of L2 where the symbol for multiplication has been replaced with 
PAD{x,y) with intended meaning x ■ 2l^L As PAD is definable with an L2-term any Lg-term 
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can be rewritten as an L2-term. Given a class of formulas ^, we say an arithmetic theory T can 
^-define a function / if there is a formula Aj in ^ such that T proves: 

(1) Thyx3lyAf{x,y) 

(2) N^yxAf{xJ{x)) 

Definition 2. For a set r of function symbols, the set A'^ is defined as follows: 

(1) The function symbols of ™ along with symbols nf for < i < n (intended 
interpretation: projections); 

(2) If f,gi,...,gr G A'^ and / is r-ary, then Cf^g^^,,,^g^ E A'^ (intended interpretation: the 
composition of / with gi, . . . ,gr); 

(3) If (7, /i G A'^, t,r G , and £ £ t, then Rg,h,t,r,e £ (intended interpretation: the function 
defined by ^-bounded primitive recursion from g, h, t, and r). 

A'^ of course corresponds to a function algebra and we shall frequently informally refer to it 
as such. We write A^ for A'tl^'^lmJ'; we shall focus primarily on these classes in all but the last 
section. Pollett J9| considers these classes where the initial functions also include multiplication. 
In particular, it is known that A^ corresponds to the polynomial time functions and Pollett shows 
that A'^ C A^ as A^ cannot define [a^/Sj. When we refer to terms (formulas, etc.) over A"^ in, e.g., 
5*2, we assume that the functions in r are defined by terms and that the defining axioms of 
the functions symbols are (conservatively) added to the theory (we shall always have A"^ CI A^). 
Using the close connection between LIND and BPR^^^'^^^ Buss ^ shows that the functions in A^ 
are precisely the functions S^-defined in ^2 . 

A couple of notations that we use frequently in this paper are: 

• For X = xi, . . . , Xk, X < n abbreviates xi < n A . . . A < n. 

• We will write #''(n) for n# . . . #n (6 — 1 #'s). 
Definition 3. 

(1) For a unary function symbol /, sPHP(f)^ is the formula n < m A3y < rriix < nf{x) ^ y. 

(2) The weak surjective pigeonhole principle for /, sWPHP{f), is the sentence \/n.sPHP{f)^ . 
If ^ is a set of function symbols, sWPHP{A) is the set of formulas sWPHP{f) for unary 
functions f € A. 

(3) The sentence sWPHP*{f) is Vn.s PrPi7P;^#"(/) and sWPHP*{A) is defined similarly. 

Proposition 1. If A is a set of function (symbols) closed under BPR^^^^'^^^^ then S2 \- s WPHP"^{A) - 
sWPHP{A). 

Proof. If /o is a surjection from 2l™l onto 2^l"^l, define surjections fr from 21"*! onto 2^'^l'"l by 
setting fr+i{x) to be the result of replacing each length-|m| block y of /r(x) by /o(y)- Then f\\m\\ 
is a surjection from 2'™l onto 2l'"ll™L □ 

3. Prefix series representation 

In this section we introduce the notion of a prefix series, which is our main technical tool for 
proving the weak surjective pigeonhole principle. 

Definition 4. 

(1) A prefix series for M from m of width w and length k is a pair {P,N) of sequences such 
that: 

(a) P = {{ao, bo),..., {okp-i, bkp-i)) and N = {{cq, do),..., (cfc^^.i, 4j^_i)); 

(b) M = Z■:o~'»^^''-E■^o'c^^'-, 
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(c) For all i, bi,di < {wl; 

(d) k = kp + kN] 

(e) For all i, either Oj = 1 or there are j and y < \mj\ such that Oj = MSP{mj,y) and 
similarly for Cj. 

(2) A bit series for M from rh of width w and length k is a prefix series for M from in of 
width w and length k in which all a^'s and Cj's are 1. 

(3) For terms t{x) and w{n) let kt^w{n) be the least k such that if mj < n for all i, then there is a 
prefix series for t{m) from m of width < w{n) and length < k. Then kt^w is the w-summand 
complexity of t {kt^w{n) may not be defined for all w). 

For any function /, if we could define the term w{n) = max{|/(x)| : x < n}, then w{n) itself 
would be a bound on kf(^^^^^: just use the binary representation of f{x) to define a bit-series. Of 
course, such a term w is problematic; our first goal is to show that for every f € there is in fact 
a w such that A;/{x),io bas a "tractable" upper bound (and in particular is defined). 
Definition 5. 

(1) PfxSeries{S, y,xi, . . . , Xr, w, k, 6) is the predicate 

S < PSqBd{xi + • • • + Xp + lu-j , 2'^''^(^'l^l)) A Vi < min(A;, \6\) 

r 

3a,b< (3{i,S)(j5{i,S) = (a, 6) A (a = 1 V \/ (3r < \xj\ {a = MSP{xj,r)))^ A 6 < A 

eval{S) = y^ 

that states that S is a prefix series for y from x of width w and length min(A;, \5\). Here 
eval is the polynomial-time function that on input (P, N) as in Definition outputs 
Ei=o~^ai2''« ^ Ef'fo"^ ^^2-^'. Note that 35.P/x5eries(S, y, f , u;, A;, <5) is a formula. We 
discuss the parameter 5 below. 

(2) Let w{n), k{n), and 5{n) be terms. PfxEound^ ^ f. g is the predicate 

3noVn > noVx < n3S.PfxSeries{S,t{x),x,w{n),k{n),d{n)) 

that states that for sufficiently large n, min(/c(n), |i5(n)|) is an upper bound on kt^w{n) (and 
in particular, kf yj (n) is defined). 

(3) BitSeries{S,y,'w, k, 6) is the predicate 

Vi < min(A;, \5\)3b < \w\ (^f3{i, S) = (1, b) A eval{S) = 

that states that 5 is a bit series for y of width w and length min(/c, |(5|). BitBoundt^ui,k,s is 
defined analogously to PfxBound^gj^g. 

The point behind the parameter 5 is to ensure that the exponentiation terms in PfxSeries and 
BitSeries are bounded by L2-terms. Our goal is now the following: given an A^-function symbol /, 
find L2-terms w, k, and 5 such that S2 \~ PfxBound in other words, find a bound on the 
lengths of the prefix series for f{x). In fact, the form of k will be made explicit, and this will allow 
us to take J = for all function symbols in A"^. However, for some preliminary observations which 
do not rely on the form of /c, we must allow this parameter to vary. 

Lemma 2. S2 proves the following: 

PfxSeries{S, y , x , w , k, 6) A w < w' A k < k' ^ PfxSeries{S, y, x, w' , k' , 5). 



5< P5gSd(l + |ii;|,2'^'°(^'l^l)) 
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In particular, for any terms t, w, w' , k, k' , and 6, 

S2 \~ (3noVn > no(w(n) < w'{n) A k{n) < k'{n)) A PfxBoundi ,^ ]^^^) PfxBound^^^, f., 
and similarly for the bit-series predicates. 

Lemma 3. For every f{x) G there is an L2-term s{x) without — or MSP (hence monotone) such 
that S2 l~ Vx(/(x) < s{x)). In particular, there is a number b such that S2 l~ BnoVn > noVx < 
n{\f{x)\ < |n|^). 

Proof. The first part is proved by induction on the definition of /. The base cases are immediate 
(bound X — y and MSP{x,y) by x) and composition is handled by substitution. Suppose / is 
defined as in Definition ^ the induction hypothesis gives us bounds Ug, ut, and Ur for g, t, and r 
respectively. Then F{y,x) < Ug{x) + Ur{y — 1, x) and hence /(x, x) < Ug{x) +Ur{\ut{x,x)\ ,x). For 
the second part, prove that for any L2-term u{x) without — or MSP there is a number b such that 
|u(x)| < \nf for sufficiently large n and x < n by induction on u. For example, if n = tii#U2, then 
take 6 = 61 + 621 where 6j is the inductively- given exponent for Uj. □ 

Lemma 4. S^ h \/x3S.BitSeries{S, ). In particular, for every A^-term u{x) there is a 

number b such that S2 \~ BitBound,^ 

Proof. For the first part use T^^-REPL to construct the sequence of pairs {BIT{i,x),i) such that 
BIT{i, x) = 1, which witnesses the claim. For the second part, fix any x; then there is an S such that 
BitSeries{S,u{x),u{x), |u(x)| ,u{x)). Now take no and b as in Lemma|31and apply Lemma|3 □ 

We call the bit series given in Lemma 0] the natural bit series for x. We need the following bound 
for calculating the length of a prefix series for (the function represented by) an MS'P-term: 

Lemma 5. The following are provable in S2: for any a, any length k and any length y: 

(1) Eto MSP{a,, 1) < M5P(i:to 1) < (Sto MSP{a,, 1)) + k - 1. 

(2) Eto MSPiai,y) < MSP{Eto Eto MSP{ai,y) + Er=o MSP{k, i). 

(3) MSP{a,y) - MSP{b,y) - 1 < MSP{a ^h,y)< MSP{a,y) - MSP{b,y). 

Proposition 6. S2 proves the following: 

3S" [[PfxSeries{S, y, x, w, k, 6) A PfxSeries{S' , y', x, w' , k' , 5')) 

PfxSeries{S" , y + y' ,x,w + w' , k + k', S6')] . 
The same claim holds with y + y' replaced with y — y' . 

Proof. Working in 52 , suppose y = P — N, and y' = P' — N' are prefix series for y and y' of 
widths w and w' and lengths k and k' respectively. If > P, then y + y' = P' — N . li N < P and 
N' > P', then y + y' = P^N. U N < P and N' < P' , then y + y' = {P + P') ^ {N + N'). In each 
case, the width and length of the prefix series are at most w + w' and k + k' respectively. □ 

We shall frequently rearrange sums of differences of sums in this way to obtain prefix series; we 
will not frequently point out that we are doing so. 

Proposition 7. S2 proves the following: 

3S' [PfxSeries{S, z, x, w, k, 6) —>■ 

PfxSeries{S',MSP{z,y),x,w + \k\ + ||A;|1 , A; + [A;] + ||A;||,(5|5| \\S\\)]. 
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Proof. Suppose P — is a prefix series for z from x of width w and length k as in Definition • 
From Lemma |S1 and arithmetic we have that Q — k\k\ < MSP{P — N,y) < Q + k\k\ where 
Q = Ei=o^ MSP{ai2^^ , y)^Y!l^Q^ MSP{ci2'^^). Thus there is some e<k\k\ such that MSP{z, y) = 
Q — e or MSP{z, y) = Q + e. Since Q is a prefix series from x of width < w and length < k, by 
Proposition El and Lemma ^ there is a prefix series for MSP{z,y) from x of width w + + \\k\\ 
and length /c + + \\k\\. □ 

We now set about showing that for m > 3 and every function symbol / G A"^ there is an L2- 

.bf 

term Wf{n) and a number bf such that if k{n) is the term ||n||'"'''" then S2 \~ PfxBound ^^,^(^^-^^^(^^-^^^2. 

More precisely, we will write Unll'"'™ for the term ||n|| ^ so that k{n) is an L2-term. 

It is also easy to see that if m > 3, then ^2 proves that ||n||'"'''" is bounded by 2l"l3^ ^ which in 
turn is bounded above by |n^| for sufficiently large n (where the point at which this holds depends 
only on h). Thus from now on, we shall simply write PfxBound f^ ^j^^j^-^ j^^^j^-^ with the bounding term 
always implicitly n^. The proof is by induction on the definition of /; we separate out the base 
case into its own proposition. 

Proposition 8. If f is an L2 -function symbol, then there is an L2-term w and a number b such 
that Sn h PfxBound , ^ « nh. 

Proof. The proof is a straightforward analysis; most cases are handled by already-proved lemmas 
and propositions. If / = 0, then Wf = kf = and if / = x^y then we can take w{n) = n^n and 
k = 1 since fxy = 1 • 2l^ll^L If fx = \x\ then an argument as in Lemma 21 applies using LemmaOl 
to bound f{x) by ||n|j''. If f{x,y) = x + y or f{x,y) = x — y then Proposition El applies and if 
/(x, y) = MSP{x, y) then Proposition [3 does. If /(x, y) = PAD{x, y), then a prefix series for f{x, y) 
from x,y is given by x ■ 21^', which has width < \n\ and length 1. □ 

Theorem 9. Ifm>3 and f is an A^-function symbol then there is an L2-term Wf and a number bf 

such that S2 l~ PfxBound where k{n) = HnH'"'™ . 

Proof. The proof is by induction on the definition of /. The base case in which / is an symbol 
is handled in Proposition |H1 

Suppose / has defining equation fx = g{hix, . . . ,hrx). By the induction hypothesis we have 
terms Wh{n), kh{n), Wg{n), and kg{n) such that 5j h f\iPfxBoundj^^g^^^^kh ^ Pf^Boundgg^^^ ^,- 
Let Uh be such that for all n > there is a prefix series for hi{x) from x < n of the given 
width and length, and define rig similarly. Furthermore take a constant B such that ii n > rih 
and X < n, then \hx\ < \n\^ . Take uq large (we shall impose constraints as the proof pro- 
gresses), n > riQ, and x < n. The induction hypothesis for hi gives us a prefix series Si for hi{x) 
from X of width Wh{n) and length kh{n) (assume uq > n^). Since no > we also have that 
hi{x) < 2l"l . Now the induction hypothesis for g gives us a prefix series Sg for g{hi{x), . . . , hr{x)) 
from hi{x), . . . ,hr{x) of width Wg{2^^^ ) and length kg(2^'^^ ) (assume 21""! > Ug). The terms 
in Sg have the form MSP{hi{x),y)2^ for some i, y and j (the terms with coefficient 1 we leave 
as they are). Replace each such term with a prefix series for PAD{MSP{hi[x),y),2^~^) from x; 
this is obtained from the inductively-given prefix series by Lemma [3 and then padding, and has 
width at most Wh{n) + Wg{2^"'^ ) and length at most kfi{n) + \kh{n)\ + \\kfi{n)\\. After replacing 
all terms in Sg in this way and rearranging if necessary (dropping expressions that evaluate to 0) 
we obtain a prefix series S for g{hi{x), . . . , hr{x)) from x of width at most Wh{n) + Wg(2l"'' ) and 
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length /cg (2 1 "I ){kh{n) + \kh{n)\ + \\kh{n)\\). Finally, by taking no large enough, A;g(2l"l ) is bounded 

above by ||n||'"''" , from which an upper bound on the length of the correct form is easily obtained, 
completing the proof for this case. 

Suppose / is defined by |- [^-bounded recursion from g, h, t, and r with intermediate function F 
as in Definition n Let k'{n) = kh{n) + Take h and c such that for sufficiently large n 

and y,x < n, \F{y,x)\ < \n\ and \t{x,x)\^ < |n|^. Now take a sufficiently large no, n > no, 
X < n, and show by length-induction on y < n that there is a prefix series for F(y, x) from y, x of 
length (3A:'(2l"'l ))ykg{n). For the induction step, since F{y + l,x) is defined as a composition of h 
with F{y,x) (the case in which F(y + l,x) = r{y,x) is immediate) an argument as in the previous 
case applies. Now taking y = \t{x, x)\^ we obtain a prefix series of length (3/c'(2l"l A:g(n) which 

I .Bc + bg + l 

we can bound by a term of the form ||n|| where B = bh + 1- Similarly we obtain a bound on 
the width of the prefix series for F{y,x) of the form yw'{2^'"^ )wg{n) where w'{n) = Wh{n) + Wr{n); 
when y = \t{x,x)\^ < |n|^, we obtain an term bounded by an L2-term in n. □ 

4. Bit series representation and the weak pigeonhole principle 

We now extract bounds on lengths of bit series representations from bounds on prefix series 
representations and use them to determine bounds on the number of times the binary representation 
of f{x) can alternate between O's and I's. 

Lemma 10. For any terms t, w, k, and 5, S2 \~ PfxEound^ ^j j.^ BitBoundi y,(^n)+\n\,\n\k{n),n#S{n)- 

Proof. Given a prefix series for t{x) from x, replace each term a2^ with 1 • 2^^*^ +••• + !• 2^^''", 
where the i exactly those i such that BIT{i,a) = 1. Since each ij < \a\ < \x\i < |n| for 

some /, the resulting bit series has width at most w{n) + \n\. Since \a\ < \n\ each term is replaced 
with a summand of at most |n| terms. Since there are at most k{n) summands, the resulting bit 
series for t{x) from x has length at most |n| k{n). □ 

Given the binary expansion of a number y, a block is a substring of all O's or all I's of maximal 
length. Let denote the number of blocks in y's binary expansion. This number can be 

S^-defined in ^2 as (#i < \y\){BIT{i,y) 7^ BIT{i + l,y)). Here < \y\)B is the operator which 
counts the number of i < \y\ such that B holds. It is known to be S^-definable in ^2 provided B 
is A\ by Buss p. 

Lemma 11. S'2 proves the following: 

yw6ykys < PSqBd{i + lu-j ,2'^'°(i'=i'i^i))[ 

Bits cries' {S,eval{S),w,\k\, 6) {#B{eval{S)) < 2[A:[ + 1)] 

where BitScrics' is the part of the definition of BitSerics (Definition 00) ) in brackets. In other 
words, the binary expansion of a number represented by a bit-series of length \k\ has at most 2\k\ + l 
blocks. 

Proof. Fix w and 5; we prove this claim by length-induction on A;. If A; = then eval{S) = 
and the claim is immediate, so assume the claim is true for k and that BitSerics' {S,w,k + 
1,6). Then eval{S) = eval{S') ± 2^ for some S' satisfying S' < PSqBd{l + \w\ ^2"^^^^^,^) and 
BitSerics' {S' , eval{S'),w,k, 5), so the induction hypothesis applies to S' . It is now a matter of 
exhausting cases on whether cval{S) = eval{S') + 2-' or eval{S) = cval{S') — 2^ and BIT{j — 



THE WEAK PIGEONHOLE PRINCIPLE FOR FUNCTION CLASSES IN 



9 



1, eval{S')), BIT{j, eval{S')), and BIT{j + 1, evaliS')) to show that #B{eval{S)) < #B{eval{S')) + 

2, from which the claim follows. □ 

Theorem 12. For any f G A^, h 3noVn > nQ.sPHPl*'^[f). 

Proof. Combining Theorem |21 with Lemmas 1 1 01 and ITTI we have that for sufficiently large n, if x < n 
then i^B{f{x)) < 4|n[ j|n||'"'3 for some fixed number b. Now 5*2 proves that Inig < [|[n|| /2j for 
sufficiently large n and that [|a| /2j < \MSP{a, [|a| /2j)| for any a; combining these, we have that 

||n||l"l3 < 2I-I3''' < 2LIIHI/2J < 2lM5P(|n|,L||nl|/2j)| < 2MSP{\n\ , Lllnll /2j). 



Thus we conclude that #B{f{x)) < 8 |n| M5P(|n| , /2j). Thus < 3 + ||n|| + 

[|lni|/2j < 3 + [3||n||/2j. On the other hand, S'j proves that if a = [M^Mj then #s(a) > 
MSP{\nf - 1, 3) (first show that n#n - 1 is a string of all I's, then analyze the grade-school algo- 
rithm for division to show that |^ "-#^~^ J = 101010 . . .; this can be done with open length- induction) . 

Thus |#B(a)| > |n|^ - 1 -3 > 2||n||-3. If ||n|| > 3 then < [3||n||/2j+3 < 2||n||-3 < 

|#_B(a)|, so we conclude that a ^ f{x). □ 

Finally, we note that the value uq in Theorem 1121 can be calculated explicitly. That is, in each 
argument of this and the previous section in which the conclusion is of the form 5*2 l~ 3noVn > no ... , 
we could have instead computed a closed term and shown 5^ l~ Vn > N . . . (adding N into the 
formalism would have entailed making our already-unpleasant notation even worse). Thus we can 
improve Theorem 1121 as follows: 

Corollary 13. h sPHP*{A^). 

Proof. Fix / G A^. As just discussed, there is a closed term N such that 5^ H Vn > N .sPHP'^^{f). 
Since is a closed term, for each M < N there is an explicit proof in ^2 of sPHP^^j^^\f), and 
hence we conclude that 5j h sPHP*{f). □ 



5. Generalizations and extensions 

Analyzing the details of the above proofs, we can determine the properties of jidjg that are 
required in order to generalize the result to function classes r. The key point is that (Inig)'' G o(||?7-||): 

Theorem 14. Let r consist of unary functions i such that: 

(1) For every £ G r there is a constant N such that S2 l~ Vn > A^(jn|3^(n) < [||n|| /2j). 

(2) For every £i,i2 G r, there is £3 € t and a number N such that S2 h Vn > N{£i{n) +^2(n) < 
^3(n)). 

(3) For every £i,£2 G r, there is £3 £ t and a number N such that ^2 h Vn > A^(£i(n)^2(n) < 
£3in)). 

Then h sPHP*{A'^). 

Proof. The proofs estimating the lengths of the prefix series carry through mutatis mutandis, with 
the new bound on the length being ||n||^'-"^ for some £ € t; the second two hypotheses are used 
in the composition and r-bounded recursion cases of Theorem [HJ The proof of Theorem relies 
on the fact that ||n|jl"l3 < 2LII"II/2J. Nqw we need |ln||^(") < 2l"l3^(") < 2LII"II/2J, which is the first 
hypothesis. □ 
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Of course, we can add any functions to the algebra A'^ provided that the conclusion of Theorem|21 

still holds. In particular, if 5*2 proves that for sufficiently large n and x < n, g{x) < 2ll"ll then 
the natural bit series for g{x) satisfies the conclusion, so any such functions can be added to A'^; 
we leave it to the reader to precisely formulate the corresponding theorem. 

Clote gives several interesting function-algebra characterizations of various complexity classes. 
Most of these rely on so-called concatenation recursion on notation and one other recursion scheme. 
The function / is defined from g, ho, and hi by concatenation recursion on notation if 

f{0,x)=g{x) 
f{2n,x) = Sho{n,x){f{n,x)), provided n / 
/(2n+ = Sh^(n,x){f{n,x)) 

Clote then shows that, for example, the log-space functions are exactly the closure of un- 
der composition, concatenation recursion on notation, and sharply-bounded recursion on notation 
(called doubly-bounded recursion on notation by Clote and Takeuti |3j). This latter scheme defines 
a function / in terms of given functions g, /iq, /ii, and b by 

/(0,x) =5(x) 
/(2n, x) = hQ{n, x, f{n, x)), provided n ^ 
/(2n +l,x) = hi{n, x, f{n, x)) 
f{n,x) < |6(n,f)| 

It is easy to see that the scheme of weak bounded recursion on notation preserves the property 
that for sufficiently large n and x < n, f{x) < 2""-" . Thus, if the techniques of this paper 
could be extended to handle concatenation recursion on notation (for which |/(x)| may now grow 
linearly in |n|), one could hope to prove some version of the weak pigeonhole principle for these 
small complexity classes. 
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